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AUTOMATED SET UP OF WEB-BASED [CONVERSATIONAL] 
NATURAL LANGUAGE INTERFACE 

In the Specification : 

Please amend the specification as follows. A clean copy of the amended 
paragraphs is attached. 

Please amend the paragraph on page 1, lines 14 to 16, as follows: 

The present invention generally relates to natural language systems and, 
more particularly, to an automated method for setting up a Web-based 
[conversational] natural language interface. 

Please amend the paragraph beginning on page 1, line 18, and continuing 
to page 2, line 2, as follows: 

The World Wide Web (WWW) portion of the Internet has seen an 
explosion of Web sites for various individual and business purposes. This in tum 
has led to a growing industry in Do It Yourself (DIY) software and Web design 
services to assist those who want to set up a Web site. 

Please amend the paragraph on page 2, Hnes 1 1 to 13, as follows: 

It is therefore an object of the present invention to provide a procedure that 
automates the process of setting up an instance of a [conversational] natural 
language interface for a Web site. 

Please amend the paragraph on page 2, lines 16 to 22, as follows: 

This invention, by automating the process of setting up a new Web site, 
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enables a new interface to be created by anyone. Subsequent manual tuning of the 
interface is possible and much easier to do than creating an interface from scratch. 
The invention solves the problem by bringing together a number of ideas and 
techniques, some of which have been used in natural language processing for other 
purposes. In order to set up an instance of a natural language [conversational] 
interface [(hereinafter NLCI)], it is necessary to 

Please amend the paragraph beginning on page 4, line 16, and continuing 
to page 5, line 17, as follows: 

Referring now to the drawings, and more particularly to Figure 1 , there is 
shown a flow diagram of the automated set up procedure. A program 
implementing a Web crawler is invoked in function block 11, beginning at the 
home page of the site for which a natural language interface is to be generated. 
The output of this module is a file of Web pages in HyperText Markup Language 
(HTML), hi function block 12, the Uniform Resource Locators (URLs) of the 
Web pages are processed to induce a hierarchy of topics for the site and the 
HTML formatted pages are converted to the appropriate standard format. In a 
preferred implementation of the invention, the standard format is [extended] 
extensible Markup Language (XML). In function block 13, sparse n-grams are 
extracted from each page to serve as index terms for the page. The index terms are 
used to set up an answer generator (search engine) for the page in function block 
14. In function block 15, a set of sparse n-grams is generated for each of the topics 
found in function block 12 by grouping together all the documents having that 
topic. Those n-grams satisfying some criterion for significant association with the 
topic are saved. In a preferred implementation of the invention, the criterion used 
is the chi-square measure. Optionally, another statistical test can be made to 
associate a confidence measure with each rule. In the preferred implementation of 
the invention, the confidence measure is the percentage of time the underlying n- 
gram occurs in the topic. [The] Each sparse [n-grams are] n-gram is converted to 



